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Abstract. A family of replicator-like dynamics, called the escort 
replicator equation, is constructed using information-geometric con- 
cepts and generalized information entropies and diverenges from 
statistical thermodynamics. Lyapunov functions and escort gener- 
alizations of basic concepts and constructions in evolutionary game 
theory are given, such as an escorted Fisher's Fundamental theo- 
rem and generalizations of the Shahshahani geometry. 



1. Introduction 

Recent interest in generalized entropies and approaches to informa- 
tion theory include evolutionary algorithms using new selection mech- 
anisms, such as generalized exponential distributions in place of Boltz- 
mann distributions [T0|[lT] and the use of generalized logarithms and 
information measures in dynamical systems [19], statistical physics [29] . 
information theory [6] , [201 - 122] , and complex network theory [30] . This 
paper brings these generalizations to evolutionary dynamics, yielding 
generalizations of the replicator equation and the orthogonal projec- 
tion dynamic, deeping the connection of the replicator equation and 
the relationship with information theory. 

Information geometry yields fundamental connections between evo- 
lutionary stability and information theory, generating the replicator 
equation from information divergences [14] . Statistical thermodynam- 
ics describes generalized information divergences, such as the Tsallis 
divergence, using escort functions [22J. Lyapunov functions are con- 
structed from generalized information divergences. Generalizations of 
Fisher's fundamental theorem and Kimura's maximal principle are for- 
mulated. 

The motivation for these constructions comes from the fact that 
while Fisher information and the Kullback-Liebler divergences have 
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nice uniqueness properties, they are not the only way to measure infor- 
mation. Many other divergences have been studied in other contexts 
and a natural question is to ask what differences arise in models of 
natural selection that measure information differently. Using alterna- 
tive logarithms allows the generalization to a large class of divergences 
and produces new evolutionary dynamics following the framework es- 
tablished in (14] . 



This section describes the necessary functions and definitions to de- 
fine the escort replicator equation. The following definitions are due 
to or adapted from contemporary work in statistical thermodynamics 
and information geometry; see [21] and [I]. 

Assume a given function 0, called an escort, that is strictly positive 
on (0, 1). An escort induces a mapping of the set of categorical prob- 
ability distributions (the simplex) into itself by applying the escort to 
each coordinate and normalizing; a similar map is induced from Wf to 
A". A common escort is 4>(x) = x q , which yields constructions of the 
Tsallis and Renyi type. This is a g-deformation as the limit q — > 1 
recovers the distribution. 

Throughout, a function of a single variable applied to a vector is to 
be interpreted as applying to each coordinate, i.e. if x = (xi, . . . ,x n ) 
then log(x) = (\og(xi ),..., log(x n )). This abuse of notation will be 
clear from context and avoids excessive notation. 

Definition 1 (Partition Function, Escort Distribution). For a distri- 
bution X — (^X ]_ - X^ i ' ' ' i 

), define the partition function 



The escort function has various effects depending on its form. In the 
case of a constant escort, all distributions are mapped to the uniform 
distribution and the mean is just the average. The induced endomor- 
phism on the set of discrete probability distributions is terminal, map- 
ping every point of the simplex to a single point, the barycenter. If the 
escort function does not have the property that 0(0) = 0, such as for 
4>{x) = e x , then the escort changes an event with probability X, = to 
nonzero probability, affecting the computation of the mean if the escort 



2. Statistical Definitions 
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distribution is used. For such an escort, The induced endomorphism 
maps the boundary of the simplex into the interior of the simplex. 

Notation for a generalized mean is convenient. The escort expecta- 
tion is the expectation taken with respect to the escort distribution in 
place of the original distribution. 

Definition 2 (Escort Expectation). For a distribution x = (xi, X2, ■ ■ ■ , x n ) 
and a vector f define the escort expectation 



Escort information divergences are defined by generalizing the nat- 
ural logarithm using an escort function. 

Definition 3 (Escort Logarithm). Define the escort logarithm 



For example, for the function 4>(x) = x q , the escort logarithm is 



The limit q — y 1 recovers the natural logarithm. A generalization of 
the logistic map of dynamical systems using this logarithm is given in 
[19] . This logarithm is used in Tsallis statistics, a generalized approach 
statistical thermodynamics, and has recently been used in complex 
network analysis. 

The escort logarithm shares several properties with the natural log- 
arithm. For instance, the escort logarithm is negative on (0, 1) and 
positive on (l,oo); it is concave if <fi is strictly increasing. Define the 
function exp^, to be the inverse function of log^. 

Definition 4 (Escort Divergences). Define the escort divergence (or 
^-divergence) 



The identity function 4>{x) = x generates the usual logarithm and 
exponential with the Kullback-Liebler divergence. Setting <p{x) = x q 
generates divergences similar to Tsallis and Renyi divergences, and the 
a-divergence of information geometry [HE]. 
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3. Geometry and Dynamics 
Define the escort metric (or 0-Fisher information metric) 

1 



<f)(Xi 



-5 



on the simplex. This is a Riemannian metric since the escort <fi is 
strictly positive and so the metric is positive definite. The metric may 
be obtained as the Hessian of the escort divergence. The identity func- 
tion <p(x) = x generates the Fisher information metric, also known 
as the Shahshahani metric in evolutionary game theory. Denote the 
simplex with the escort metric as the escort manifold. 

The Shahshahani metric pulls back to the Fisher information metric 





9iA x ) 



E 



d\ogpd\ogp 



dx % dxi 

The escort metric can be obtained from a Fisher-like formula [23] 

'dlogpdhg^p' 



In coordinates, 

9ij{ x ) 



gij(x) = E 



Ex k —S, 



dx l dxi 



.!■/, ^ * (j){x k ) 



fc.3 



4>{xi 



A symmetric form that yields the escort metric is 

" d log, pd log. p 



9lj ( P ) = z^xM 



dxi 



d Xj 



which also reduces to the Fisher information metric if <f>(x) = x. 

The transformation x h- )■ l^fx transforms the Shahshahani manifold 
into the positive portion of the radius 2 n-sphere with the Euclidean 
metric [2]. The analogous transformation to Euclidean space for the 

escort met ,c is given by the ^derivative ! 1 



dx direct 



.r 



computation of the Jacobian shows. The special case of 



[x 



x gives 



Akin's result and a geodesic genomic distance d(p, q) = 2 arccos 



In principle, these transformations allow for the derivation of general- 
ized measures of genomic distance. 
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It is known that the replicator equation is the gradient flow of the 
Shahshahani manifold, with the right hand side of the equation a gra- 
dient with respect to the Shahshahani metric if the fitness landscape 
is a Euclidean metric [15j|26]. This generalizes to the escort manifold. 

Proposition 1. Let f$ ti = 4>{ x d {fi( x ) ~ ^tlfi x )])>' f<t> ^ s a gradient 
with respect to the ^-metric if f is a Euclidean gradient. 

Proof. For notational convenience we denote by /j for this proof. 
Note first that for f(x) to be in the tangent space of the n-simplex 
requires that £\ /^(x) = 0. 

= Z>(*i) !/(*)]) 

i i 

i i 

= ^2<i>( x t)fi( x ) -^2^ x j)M x ) = ° 

» 3 

To see that / is a gradient with respect to the escort metric, recognize 
that the gradient is defined uniquely by the relation < grad/, z > x — 
D x V(z), where / is the euclidean gradient of V. 

< f,z >x = Y] -^—fi(x)zi 



E^y^) (M x )-®i [/(*)])* 
j2fi(x)zi-J2 z i{ E i [/(*)]) 

i i 

E/*(^- (l>) K !/(*)]) 

E •M 3 ^ = E = D * v ( z ) 



□ 



Define the escort replicator equation as 

(1) ii = 0(si)(/<(a:)-!g [/(x)]) 

Like the replicator equation, it is the gradient flow only when the fitness 
landscape is a Euclidean gradient. 
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The escort dynamic can be arrived at by deforming the replicator 

equation: noting that — log, Xi = f 1 . , rewrite the escort dynamic as 

at v 4>{Xi) 

^0^x i )=Mx)-E* [f(x)\, 
which is a deformation of the replicator dynamic, 

±(\ogx i ) = f i (x)-E x [f(x)]. 

Remark 1. The escort dynamic shares many properties with the repli- 
cator dynamic. For instance, a Nash equilibrium of the fitness land- 
scape f is a rest point of the escort dynamic. Indeed, if x is a Nash 
equilibrium, there is a constant c such f(x)i = c for all i G supp(x), 
from which it follows that Ef. [/(£)] = c. This gives a rest point of 

the dynamic since Xi = 4>{xi) (ji(x) — Ef. [f(x)]j = (f>(£i)(c — c) = 0. 

Another method to understand that Nash equilibria are stationary is 
that the escort dynamic is an orthogonal projection with respect to the 
escort metric. Since the orthogonal complement to the tangent space of 
any point on the interior of the simplex is the span of (1,1, ... ,1), a 
Nash equilibrium is already orthogonal. 

4. Generalizations of Fundamental Results 

A generalized version of Kimura's Maximal Principle follows imme- 
diately from the fact that the escort dynamic is a gradient. A simple 
calculation gives an analog of Fisher's Fundamental Theorem of Natu- 
ral Selection. 

Theorem 2 (Escorted Fisher's Fundamental Theorem of Natural Se- 
lection) . 

±V(x) = Z+(x)Var*\f(x)] 

Proof. 

n 

V(x) = D x V(x) =< U(x), U(x) >= -r-M^)(Mx) - Et [f(x)})} 
= Z*(x)Ei - Et [f(x)]f] = Z,(x)Vari[f(x)} 

□ 

The Kullback-Liebler divergence gives a Lyapunov function for the 
replicator equation, a result which can now be generalized. 
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Theorem 3. A state x is an ESS if and only if the escort divergence 
D^ipbWx) is a Lyapunov function for the escort dynamic. 

Proof. Let V(x) = D^(x\\x). For non- decreasing 0, V is positive, and 
zero if and only if x — x. 

Taking the time derivative gives: 

v = - y2(xi-Xi)-J—Xi 

= -'£(£ i -x i )(f i (x)-E* [/(*)]) 

i 

— — (x — x) ■ f(x) < 0, 

where the last inequality is true if and only if the state x is an ESS. □ 

Remark 2. The escort dynamic is a special case of the construction of 
the adaptive dynamic. This immediately gives a local Lyapunov func- 
tion from the geodesic approximation fTh^ . A second is given in the case 
of an escort gradient, if the landscape is a Euclidean gradient. A third 
is given above in terms of the dual divergence in the preceding theorem, 
which is global and independent of the form of the fitness landscape. 

5. Examples 

5.1. Replicator dynamic. Let (f)(x) = x be the identity function. 
The log^ and escort distributions are then just the ordinary logarithm 
and distribution, and the escort replicator equation is the replicator 
dynamic. The induced metric is the Shahshahani metric, and the in- 
duced divergence is the Kullback-Liebler divergence. For a detailed 
exposition of this case see [H]. 

5.2. Replicator dynamic with selection intensity. Let 4>(x) = fix, 

where /3 > is the inverse temperature. The induced escort mapping 
is the identity because the parameter (3 cancels. The escort logarithm 
is -s log x and the induced escort dynamic is 

Xi = I3xi(fi(x) - f(x)), 

a form in which special cases have been derived using Fermi selection 
from stochastic differential equations (with parameter 0/2) [23]. The 
parameter can be interpreted as the intensity of selection. It affects 
the velocity of selection but not the trajectories. This is related to the 
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fact that the Fisher information metric is the unique metric, up to a 
constant multiple, on the simplex that respects sufficient statistics [7]. 

In general, given any escort function tjj, a new escort <p(x) = j3ip(x) 
induces an escort dynamic with a leading intensity of selection factor. 
The trajectories are the same up to a change in velocity. 

5.3. F-divergences. Let F be a convex function on (0, oo) and con- 
sider the F-divergence 



The F- divergence, when localized, produces a metric of the form gij(x) = 
F"(l)—8ij, so these divergences yield dynamics that are special cases 
of the previous example [I]. 

This example also informs the definition the escort divergence. The 
Kullback-Liebler divergence is often written as 



This definition would fall into the case of an F-divergence and so 
not produce interesting new dynamics, and follows from the fact that 
^og^xy) ^ log x + log^ y in general. 

5.4. g-deformed replicator dynamic. Let <f)(x) = x q . This gener- 
ates the g-metric x~ q 8ij and the g-deformed replicator dynamic 



The limiting case q — >■ 1 yields the replicator equation, as is usual for 
g-deformations. The g-deformed replicator dynamic is formally similar, 
but not identical, to the dynamic generated by the geometry of escort 
parameters, viewing q as parameter, derived in pp. For detail regard- 
ing the escort logarithms and Tsallis entropy see [IB]. For a detailed 
exposition of the information geometry in this case see [23]. The case 
q = is the orthogonal projection dynamic [T7] . 





which suggests the escort divergence take the form 
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5.4.1. Poincare dynamic. Consider the case <f)(x) = x 2 . This generates 
the Poincare metric [TB], and an entropy similar to Simpson diversity. 
Call the associated dynamic the Poincare dynamic. 

Let us consider the following explicit example, adapted from |15j . For 
the replicator equation with a fitness landscape given by a matrix A, 
so that f(x) = Ax, the game matrix A is called zero sum if a. 
This yields a mean fitness f(x) 
the rock-scissors-paper matrix 



x ■ Ax = 0. For instance, consider 



.4 



1 -1 
-1 1 

1 -1 



This gives a zero-sum game with fitness landscape f(x) = (x 2 — x 3 , x 3 — 
Xi,Xi — x 2 ) an d internal rest point x = (1/3, 1/3, 1/3). The dynamic 
has an integral of motion given by X1X2X3, or equivalently, an integral 
of motion £\ Xi log Xj. 

For the Poincare dynamic, it is not the case that [f(x)] = 0, but 
the quadratic fitness landscape /(x) = A(f)(x) = [x\—x\,x\~x\,x\—x\] 
does have zero escort mean. This property is an analog of zero-sum, 
having a similar effect on the Poincare dynamic which now takes the 
form Xi = xffi(x). The state x = (1/3,1/3,1/3) is still an interior 

equilibrium, and using the fact that log^x) = 1 , we have an 

integral of motion for the Poincare dynamic given by ^ x^ log^ Xj = 
|(3 — -), or equivalently, — + — + —. Observe that 

d v X\ X2 X3 > 1 ^ J ' Xi X2 X'i 



d 

dt 



1 

Xl 



1 

X2 



1 



Xi X2 



X 1 



■1 1 



.1 •- 



ifi(x)+f2(x)+fs{x)) = 0. 



The example easily generalizes to longer cycles and arbitrary escort 
functions, yielding a constant of motion Xi log^ Xj in general. 



5.5. Orthogonal Projection Dynamic. Let <j>(x) = 1, which gen- 
erates the Euclidean metric. The resulting dynamic is the orthogonal 
projection dynamic, 

1 n 

Xi = fi{x) ^2fi(x), 

i=0 

where the expectation is the average. The Lyapunov function takes the 
form ^\\x — x\\ 2 , which captures the known result. [25] 



10 



MARC HARPER 



5.6. Exponential Escort Dynamic. Another example of a dynamic 
that is not forward- invariant is the dynamic generated by <f>(x) = e x : 



For an explicit example, consider the fitness landscape f(x) = e x = 
(e~ Xl , . . . , e~ Xn ). After some simplification, the dynamic takes the form 



the right-hand side of which is clearly nonzero for Xi = 0, unless Xj = 
for all j, which is impossible because of the constraint Y^j x j = 1- The 
barycenter (^, . . . , ^) is an interior rest point. Note that the escort 
geometry in this case does not diverge on the boundary. 

5.7. Remarks. Notice that the escort replicator equation is forward- 
invariant on the simplex, in the case of a non-constant fitness landscape, 
if and only if 0(0) = 0, which is the case for the g-deformed replica- 
tor dynamic if q > 0, but not the case for the orthogonal projection 
dynamic, though the geometry is trivial. The exponential escort dy- 
namic yields an example that is not forward-invariant with a non-trivial 
geometry. 

The escort dynamic satisfies a gauge invariance similar to the repli- 
cator equation. The orthogonal complement to the tangent space of 
the simplex is the line generated by the vector 1 = (1,1,...,1). Any 
function g(x)l mapping into the complement can be added to the fit- 
ness landscape without affecting the trajectories of the escort dynamic 
because the quantity f\(x) — [/(#)] is preserved: fi(x) + g(x) — 



¥,t [f(x)+g(x)l] = f i (x)+g(x)-(Et [f(x)]+g(x)) = / 4 (x)-E* [f(x)\. 



In some cases, the escort replicator equation can be transformed 
into the replicator equation with an altered fitness landscape. If <fi 
is invertible and differentiate on [0,1], with nonzero derivative, the 
transformation y = <f>(x) translates the escort replicator equation to 



Vi = MxiMxi) (/Ms)) - K [f{<t>{x))\) = #(xi)vi (My) - [f(y)}) , 



so that for the fitness landscape g(y) = f((f>(x)), we have that 



If (f)'((j) is strictly positive and does not depend on i then the last 

equation can be transformed into the replicator equation 




m = 0'O 1 {yi))yi{g{y)-g{y))- 



v% = Vi (g(y) - g(y)) , 
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by a strictly monotonic change in time scale. An example is 4>{x) = 
(3x, which was shown above to produce a dynamic equivalent to the 
replicator equation. 

To determine if the dynamics generated by <fi{x) and ip{y) are distinct 
with respect to change of velocity, consider the equations 

x i = <j)(x i )(f i (x)-Et [f(x)]) and 

in = <Kvi) -Ef [f(y)\). 

Assuming that ip is differentiable with different iable invertible and that 
is differentiable, the transformation 4>(xi) = ipiUi) yields the equation 

ili = ty- 1 )\<t>(x i ))<l>\x i )<i>(x i ) (Mx) - E* [f(x)\) 

If (<f)(xi))<j)' (xi) does not depend on i, the <fi dynamic is a ip 

dynamic (for an altered landscape) after a change in velocity. In the 
case of <j>(x) = x p and if)(y) = y 9 , this quantity becomes 

q 

which is independent of i if and only if p = q. 

5.8. Dynamics from Other Common Information Divergences. 

The escort dynamic is rather general in that it captures the localized 
form of many other commonly used information divergences. Bregman 
divergences are summations «) °f ^ ne i° Tm ^{u, v) = f(u)—f(v) — 
f'{v){u — v), which localize to <p(v,v) = f"(v), which correspondes to 

an escort given by the relation f"(v) = . Similarly for Burbea-Rao 

<j)[V) 

divergences, up to a multiplicative constant [8j . Straight-forward com- 
putations show that Jensen- Shannon divergences [18] and capicitory 
discrimination [T2|I28"] localize to (possibly a constant multiple of) the 
Shahshahani form as well. 

6. Solutions of the Escort Replicator Equation 

In terms of the geometry, exponential families are normal coordi- 
nates on the manifold. The generalized exponential furnishes a formal 
solution to the escort dynamic. Let v be a solution to V{ = fi(x); 
then Xi = exp^Vi — G) is a solution to the escort dynamic where 
G = [/(#)], which follows from the fact that x\ + ■ ■ • + x n = 0. 
The following fact regarding the derivative of the escort exponential is 
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needed for the proof of the solution (easily shown with implicit differ- 
entiation). 

Lemma 1. 

— exp^x = (f)(exp^x) 

Now observe that 

Xi = ^exp^Vi -G) = 0(exp (^-C7))(^-C7) = <p(xi)(fi(x)-Ki [f(x)]). 

The escort exponential solution is the maximal entropy distribution 
with respect to the generalized information divergence [22] . Geometri- 
cally, the exponentials correspond to the exponential map induced by 
the manifold structure. 

6.1. Escort AfRne Landscapes. In the case that the fitness land- 
scape is escort affine, i.e. f(exp^(x)) = Ax + b for a matrix A and 
vector b, an explicit solution can be given. We assume that 6 = 
for simplicity, referring to the fitness landscape as escort log-linear. 
Adapting the method of [5J, let f(x) = Alog^x) for some matrix A, 
and perform a gauge transformation on A so that the columns are or- 
thogonally projected on the tangent space of the simplex. Explicitly, 
the matrix A = (ay) becomes A c = (a^- — - Y2i a ij)i which is a valid 
gauge transformation as it amounts to adding a particular constant 
value to each column to ensure that the column sums to zero. The 
equation for the variable v becomes Vi = Avi — AG = A c Vi, using 
Xi = exp^(f i — G). This equation can be solved by eigenvalue methods 
and, if the equation for G can be solved, will yield an explicit closed 
form. 

7. Discrete Escort Replicator Equation 

An escort also gives an escorted discrete escort replicator dynamic. 
The escorted versions take the form 

, _ <j>(Xi)fi(x) 



K [/(*)] 

A particularly simple case is the discrete analog of the orthogonal 
projection dynamic 

, fi(x) 

which treats the fitness landscape as a non-normalized probability dis- 
tribution estimating the population distribution. 
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As for the replicator equation, the escort dynamic can be obtained 
by a limiting procedure from the discrete analog: 

n l> Et [f(x)} n l> 

<>(r ' ] ifi(x)-Ei[f(x)]). 



n [/(*)] 

After a change in velocity to remove the strictly positive scalar func- 
tion Ef [/(x)] from the denominator, the escort dynamic is obtained. 
Because of this correspondence, the escort inference equation is ex- 
pected to behave well with respect to the escort divergences and escort 
exponential families, in analogy of the relationship between Bayesian 
inference and the Kullback-Liebler divergences 



8. Vector- valued Escorts 

As a final example, notice that we can use a more general escort 
function that is a vector-valued function of the population distribution. 
For instance, consider the escort ip(x) = (e^ 1 ^'* 1 , . . . , e^™^). Then the 
above definitions still go through with the appropriate changes, e.g. 

S' 

the metric is now of the form g^ix) = — y— , which is positive definite, 

and the induced dynamic is 

x 4 = ^(x) (fi(x)-Et [f(x)}), 

which differs because of the dependence on i and the entire distribution 
in the leading term ip(x). The notational convention that ip(x) means 
i[) is applied to each coordinate can be dropped for a vector-valued ip 
to obtain the definitions for escort mean and related quantities. If ip is 
identical in each coordinate, the leading factor can be eliminated with 
a change of velocity, since the dependence on i is removed. (This is 
subtly different than for the escort dynamic, where <p(xi) has depen- 
dence on the distribution argument but not the functional argument.) 
For a concrete example, the escort ip(x) = (PiXi, . . . , /3 n x n ) yields a 
replicator-like dynamic with differential selection pressures on the in- 
dividual types. 

With general ip, care must be taken with the induced logarithms, 
as they differ for each coordinate and the defining integrals are now 
possibly vector-valued, and with the form of the information diver- 
gences. Nevertheless, many of the computations above hold with only 
minor changes (e.g. introduction of indicies for the escort), such as the 
generalization of Fisher's fundamental theorem. 
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9. Discussion 

Incorporating generalized information entropies from statistical ther- 
modynamics yields interesting new dynamics and defines a class of dy- 
namics that includes the replicator equation and the orthogonal pro- 
jection dynamic. Some of the dynamics have interesting combinations 
of features, such as non-forward-invariance, while retaining nice infor- 
matic properties. By using constructions from information geometry, 
Lyapunov functions can be given for the class of dynamics arising from 
escorts that yield convex logarithms. Analogs of Fisher's fundamental 
theorem and other results follow naturally, yielding new evolutionary 
models. 
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